Picture for Hejie Cui

Hejie Cui

CoMem: Context Management with A Decoupled Long-Context Model

Add code
May 29, 2026
Viaarxiv icon

EHRBench: An Automated and Reliable EHR-based Benchmark for Clinical Decision Making with LLMs

Add code
May 28, 2026
Viaarxiv icon

T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

Add code
May 04, 2026
Viaarxiv icon

Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning

Add code
Feb 27, 2026
Viaarxiv icon

HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning

Add code
Jan 30, 2026
Viaarxiv icon

WebCoach: Self-Evolving Web Agents with Cross-Session Memory Guidance

Add code
Nov 17, 2025
Viaarxiv icon

KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs

Add code
Jul 03, 2025
Figure 1 for KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs
Figure 2 for KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs
Figure 3 for KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs
Figure 4 for KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs
Viaarxiv icon

MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks

Add code
May 26, 2025
Figure 1 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Figure 2 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Figure 3 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Figure 4 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Viaarxiv icon

Data Foundations for Large Scale Multimodal Clinical Foundation Models

Add code
Mar 09, 2025
Figure 1 for Data Foundations for Large Scale Multimodal Clinical Foundation Models
Figure 2 for Data Foundations for Large Scale Multimodal Clinical Foundation Models
Figure 3 for Data Foundations for Large Scale Multimodal Clinical Foundation Models
Figure 4 for Data Foundations for Large Scale Multimodal Clinical Foundation Models
Viaarxiv icon

TIMER: Temporal Instruction Modeling and Evaluation for Longitudinal Clinical Records

Add code
Mar 06, 2025
Viaarxiv icon